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Abstract 

Transmission of information over a discrete-time memoryless Rician fading channel is con- 
sidered where neither the receiver nor the transmitter knows the fading coefficients. First the 
structure of the capacity-achieving input signals is investigated when the input is constrained 
to have limited peakedness by imposing either a fourth moment or a peak constraint. When 
the input is subject to second and fourth moment limitations, it is shown that the capacity- 
achieving input amplitude distribution is discrete with a finite number of mass points in the 
low-power regime. A similar discrete structure for the optimal amplitude is proven over the 
entire snr range when there is only a peak power constraint. The Rician fading with phase- 
noise channel model, where there is phase uncertainty in the specular component, is analyzed. 
For this model it is shown that, with only an average power constraint, the capacity-achieving 
input amplitude is discrete with a finite number of levels. For the classical average power 
limited Rician fading channel, it is proven that the optimal input amplitude distribution has 
bounded support. 

Index Terms: Fading channels, memoryless fading, Rician fading, phase noise, peak con- 
straints, channel capacity, capacity-achieving input. 
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1 Introduction 



Recently, the information theoretic analysis of fading channels is receiving much attention. This in- 
terest is motivated by the rapid advances in wireless technology and the need to use scarce resources 
such as bandwidth and power as efficiently as possible under severe fading conditions. Providing 
the ultimate performance, information theoretic measures such as capacity, spectral efficiency and 
error exponents can be used as benchmarks to which we can compare the performance of practical 
communication systems. Furthermore with the recent discovery of codes that operate very close to 
the Shannon capacity, information theoretic limits have gained practical relevance. Although the 
capacity and other information-theoretic measures of fading channels were investigated in as early 
as the 1960 's (0], [Hj), it is only recently that many interesting fading channel models have been 
considered under various practically related input and channel constraints. 

A significant amount of effort has been expended to study fading channel models where side 
information about the fading is available at either the receiver or the transmitter or both (see 
[22] 5 |2S]; |21]; [22] )• However, under fast fading conditions noncoherent communications, where 
neither party knows the fading, often becomes the only available alternative. Richters considered 
the problem of communicating over an average power limited discrete-time memoryless Rayleigh 
fading channel without any channel side information. He conjectured that the capacity-achieving 
amplitude distribution is discrete with a finite number of mass points. Recently, Abou-Faycal et al. 
[S] gave a rigorous proof of Richters' conjecture. This result shows that when the fading is known by 
neither the transmitter nor the receiver, the optimal amplitude distribution has a notably different 
character than that of unfaded Gaussian channels. A similar discrete structure for the optimal 
input was also shown in (TU] for the pulse amplitude modulated direct detection photon channel 
when average and peak power limitations are imposed on the intensity of a photon emitting source. 
Katz and Shamai jH] considered the noncoherent AWGN channel and proved that the optimal input 
amplitude is discrete with an infinite number of mass points. Lapidoth fE] recently analyzed the 
effects of phase noise over the AWGN channel, characterizing the high-SNR asymptotics of the 
channel capacity for a general class of phase noise distributions with memory. An extensive study 
of the capacity of multi-antenna fading channels at high SNR was conducted in JH] • 

Kennedy j4j showed that the infinite bandwidth capacity of fading multipath channels is the 
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same as that of the unfaded Gaussian channel. Although any set of orthogonal signals achieves 
this capacity for the unfaded Gaussian channel, orthogonal signals that are peaky both in time and 
frequency are needed in the presence of fading j2Zl Sec. 8.6]. Indeed, for a general class of fading 
channels, Verdu ^ has recently shown that if there are no constraints other than average power, flash 
signaling, a class of unbounded peak-to-average ratio inputs defined in 0, is necessary to achieve 
the capacity as SNR when the channel realization is unknown at the receiver. Flash signaling 
can be practically employed in systems where sudden discharge of energy (e.g., using capacitors) is 
allowed, thus sidestepping the use of RF amplifiers. However, these peaky signals are not feasible 
in communication systems subject to strict peak-to-average ratio requirements. Furthermore, in 
some systems CDMA-type white signals which spread their energy over the available bandwidth 
are used because of their anti-jamming and low probability of intercept capabilities. Hence, it is of 
interest to investigate the effect upon the capacity of imposing peakedness constraints, especially 
in the low-power regime. Medard and Gallager jTTj considered noncoherent broadband fading 
channels with no specular component and limited the peakedness of the input signals by imposing a 
fourth moment constraint. Then they showed that such a constraint forces the mutual information 
to zero inversely with increasing bandwidth. Since CDMA-type signals spread their energy over 
the available bandwidth, they satisfy the above fourth moment constraint. Therefore, Medard 
and Gallager conclude that CDMA-type white signals cannot efficiently utilize fading multipath 
channels at extremely large bandwidth. Other results on this theme are obtained by Telatar and 
Tse [19^ and Subramanian and Hajek jl8j . 

In this paper, we consider the noncoherent discrete-time memoryless Rician fading channel and 
study the capacity and the optimal input structure. In a companion paper [Part II], we further 
investigate the spectral-efficiency/bit-energy tradeoff in the low-power regime. The organization of 
the paper is as follows. In Section|2l we introduce the Rician fading channel model. In SectionlHl we 
characterize the structure of the capacity achieving input distribution in the low-power regime when 
the channel input is constrained to have limited peakedness. In Section HI the average power limited 
Rician fading with phase-noise channel, where there is phase uncertainty in the specular component, 
is introduced and the structure of the capacity-achieving input is investigated. Numerical results 
are given in Sectional while Section IHl contains our conclusions. 
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2 Channel Model 



In this paper, we consider the following discrete-time memoryless Rician fading channel model 

yi = mxi + a-iXi + rii (1) 

where {oi} and {rii} are sequences of independent and identically distributed (i.i.d.) circular zero 
mean complex Gaussian random variables, independent of each other and of the input, with vari- 
ances i?{|ajp} = 7^ and = Nq, m is a deterministic complex constant, Xj is the complex 
channel input and Ui is the complex channel output, {aj} and {rii} represent sequences of fading 
coefficients and background noise samples, respectively. 

The Rician fading channel model is particularly appropriate when there is a direct line-of- 
sight (LOS) component in addition to the faded component arising from multipath propagation. 
Moreover, the Rician model includes both the unfaded Gaussian channel and the Rayleigh fading 
channel as two special cases. Hence, results obtained for this model provide a unifying perspective. 

In the channel model (^, fading is assumed to be flat and hence has a multiplicative effect on 
the channel input. This is a valid assumption if the delay spread of the channel is much smaller than 
the symbol duration. Moreover, frequency selective fading channels can often be decomposed into 
parallel non-interacting flat fading subchannels using orthogonal multicarrier techniques. Note also 
that the fading coefficients assume independent realizations at every symbol period. Under such 
fast fading conditions, reliable estimation of the fading coefficients may be quite difficult because 
of the short duration between independent fades. Therefore, we consider the noncoherent scenario 
where neither the receiver nor the transmitter knows the fading coefficients {aj}. 

3 Channel Capacity and Optimal Input Distribution 

In this section, we elaborate on the structure of the capacity achieving input distribution for the 
Rician fading channel when the input has limited peakedness which is achieved by imposing a fourth 
moment or a peak limitation on the input amplitude. 
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3.1 Second and Fourth Moment Limited Input 

We first assume that the input amplitude is subject to second and fourth moment constraints: 



E{\Xi\^} < Pav (2) 

E{\x,\'] < kPI Vz (3) 

where Pav is the average power constraint and 1 < k < oo. When the average power constraint is 
active, the fourth moment constraint is equivalent to limiting the kurtosis, \ ' :2 < which is a 
measure of the peakedness of the input signal. For the Rician channel model the capacity is the 
supremum of the input-output mutual information over the set of all input distributions satisfying 
the constraints (j21) and ^ 

C= sup / / fyyM^) log%^ dy dF.(x) (4) 
F,{-) Jc Jc Jy{y) 

E{\x\^}<Pav 
i?{|x|4}<KP2^ 

where the conditional density of the output given the input, 



is circular complex Gaussian. Moreover note that fy{y) = fy\x{y\x) dF^i^x) is the marginal output 
density, is the distribution function of the input and C denotes the complex plane. 
First we have the following preliminary result on the optimal phase distribution. 

Proposition 1 For the Rician fading channel (OP and input constraints (0j and (0), uniformly 
distributed phase that is independent of the input amplitude is capacity- achieving . 

Proof: The result follows readily from the arguments in Sec. IV.D.6] where the optimality of 
circularly symmetric input distributions is pointed out in a more general setting. Assume that an 
input random variable x generates a mutual information, Jq. Consider a new input xi = xe^^ where 

6 is independent of x and uniformly distributed on (— vr, vr]. Since the conditional distribution has 

"'^Since the channel is memoryless, without loss of generality we can drop the index i. 

^The result holds in wider generality: the feasible set defined by lO and © can be replaced by any set of 
constraints that are imposed only on the input magnitude. 
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the property fy\x{.^'^^yW^x) = fy\x{y\x) for any 9, it can be easily seen that mutual information 
is invariant under deterministic rotations of the input distribution and hence I{xi;y\6) = Iq. By 
the concavity of the mutual information over input distributions, we have I{xi]y) > Iq- Further 
note that input constraints Q and (jS)) are also invariant to rotation. Therefore there is no loss in 
optimality in considering input random variables that has uniformly distributed phase independent 
of the amplitude. □ 

Note that if |m| = 0, we have a Rayleigh fading channel where the phase cannot be used to 
convey information when the channel is unknown. However, if there is a line-of-sight component, 
i.e., \m\ > 0, phase can indeed carry information and by Proposition ^ the uniform distribution 
maximizes the transmission rate. 

With this characterization, we have reduced the optimization problem (j3)) to optimal selection 
of the distribution function of the input amplitude, F\x\{-), under the constraints < Pav 

and < /t-P^i,. For the sake of simplification in the notation, we introduce new random 

variables R = and r = Assuming a uniform input phase that is independent of the 

input magnitude, the mutual information in nats, after a straightforward transformation, can be 
expressed as follows: 

poo poc 

liFr) = I{x- y) = - fniR; F,) In /^(i?; F,) dR- / ln(l + r^) dF,,(r) - 1 (6) 
^0 Jo 

where fniR] F^) = g{R, r) dFr(r) is the density function of R with the kernel given by g{R, r) = 
j^exp ^— -^Y+^j -^0 (^ ^^'^2^ j • Furthermore, Fr is the distribution function of r and K = is 
the Rician factor. Now, the capacity formula can be recast as follows 

C(a, K, K) = sup I{Fr) (7) 

Fr 

E{r^}<a 
E{r*}<K,a^ 

depending on three parameters, namely, a = 7^-^, the normalized SNR; k; and K, the Rician 
factor. In pp, the existence of an optimal amplitude distribution achieving the supremum in ^ 
is shown and the following sufficient and necessary condition for an amplitude distribution to be 
optimal is derived employing the techniques used in |H]. 
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Proposition 2 fKuhn- Tucker Condition ) For the Rician channel 0) with input constraints |^ 
and (0), -^0 is a capacity achieving amplitude distribution if and only if there exist Ai, A2 > such 
that the following is satisfied 



g{R,r)\nfR{R,Fo)dR + \n{l + r^)+Xi{r^-a) + X2{r^-Ka^)+C + l>0 Vr > (8) 



with equality if r E Eq where Eq is the set of points of increase^ of Fq. 

Note that in the above formulation, Ai and A2 are the Lagrange multipliers for the second and 
fourth moment constraints respectively. Using Proposition |21 we have the following result on the 
optimal amplitude distribution. 

Theorem 1 For the Rician fading channel ^ with input amplitude constraints ^ and if the 
fourth moment constraint ^ is active then the capacity- achieving input amplitude distribution is 
discrete with a finite number of mass points. 

Proof: The result is shown by contradiction. The proof can be summarized as follows: 

i) We first contradict the assumption that the optimal distribution has an infinite number of 
points of increase on a bounded interval. 

ii) Next, we contradict the assumption that the optimal distribution has an infinite number of 
points of increase (mass points) but only finitely many of them on any bounded interval. 

iii) Ruling out the above assumptions leaves us with the only possibility that the optimal input 
has a finite number of mass points. 

Assume Fq is an optimal amplitude distribution. To prove the theorem, we first find a lower 
bound on the left hand side (LHS) of the Kuhn- Tucker condition (jH)). To that end, we first bound 
fji as follows. 





(10) 



(9) 





■^The set of points of increase of a distribution function F is {r : F{r — e) < F{r + e) Ve > 0}. 
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where < Dp^ < I is a constant depending on Fq. The first inequahty is obtained from the fact that 
Io{x) > 1 Vx > and the second inequahty comes from observing that e~^^''^~^^''^ > \/R,r > 0. 
Using the lower bound and noting that g{R,r) is a noncentral chi-square density function in 
R, we have the foUowing bound on the LHS of the Kuhn- Tucker condition (jH)) 

LHS > lnD^„ - 1 - (1 + Ky + ln(l + r^) + Xi^ - a) + A2(r^ - na'^) + 1 + C Vr > 0. 

If the fourth moment constraint is active, i.e., A2 > 0, then for ah Dpg > and Ai > 0, the 
above lower bound diverges to infinity as r — 00. Now, we establish contradiction in the following 
assumptions. 

i) Assume that the optimal distribution Fq has an infinite number of points of increase on a bounded 
interval. Note that this assumption is satisfied by continuous distributions. First we extend the 
LHS of the Kuhn- Tucker condition (jHl) to the complex domain. 

POO 

^{z)= I g{R,z)\iifR{R,Fo)dR + \Ti{l + z'^) + \i{z^ -a) + \2{z* -Ka^)+C + 1 (12) 

where 2; G C. By the Differentiation Lemma ^T!, Ch. 12], it is easy to see that $ is analytic in 
the region where Re{l + -z^} > [H Appendix C]. This choice of region guarantees the uniform 
convergence of the integral expression in (jl2j) . Note that in this region, by our earlier assumption, 
$(2;) = for an infinite number of points having a limit point^. By the Identity Theorem^, ^{z) = 
in the region where Re{l + z'^} > 0. Hence the Kuhn- Tucker condition (jSJ is satisfied with equality 
for all r > 0. Clearly, this case is not possible from the above lower bound which diverges to infinity 
as r — > 00. 

ii) Next assume that the optimal distribution has an infinite number of mass points but only finitely 
many of them on any bounded interval. Then the LHS of the Kuhn- Tucker condition should be 
equal to zero infinitely often as r — > 00 which is again not possible by the above diverging lower 
bound. 

Hence the optimal distribution must be discrete with a finite number of mass points and the theorem 

""^The Bolzano- Weierstrass Theorem [2^1 states that every bounded infinite set of real numbers has a limit point. 

^The Identity Theorem for analytic functions 'SH' states that if two functions are analytic in a region TZ, and if 
they coincide in a neighborhood, however small, of a point zq of 7?,, or only along a path segment, however small, 
terminating in zq, or also only for an infinite number of distinct points with the limit point zq, then the two functions 
are equal everywhere in TZ. 
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follows. □ 
The significance of Theorem ^ comes from the fact that for the Rician fading channel, any fourth 
moment constraint with a finite n will eventually be active for sufficiently small SNR because, as 
observed in Sec. V.E], if there is no such constraint, the required value of k grows without 
bound as SNR 0. Therefore, Theorem ^ establishes the discrete nature of the optimal input in 
the low-power regime. Furthermore, Theorem ^ easily specializes to the Rayleigh and the unfaded 
Gaussian channels. For the unfaded Gaussian channel, if the fourth moment constraint is inactive, it 
is well known that a Rayleigh distributed amplitude is optimal and it has kurtosis k = 2. Therefore 
the fourth moment constraint being active (i.e., 1 < k < 2) is also a necessary condition for the 
discrete nature in that case. In jH], the optimal amplitude for the average-power-limited noncoherent 
Rayleigh fading channel is shown to be discrete with a finite number of levels over the entire SNR 
range. Theorem Q proves that this discrete character does not change when we have an additional 
fourth moment constraint. 

We note that the key property which leads to the proof of Theorem ^ is that we have a moment 
constraint higher than the second moment. Hence the result of Theorem ^ holds in a more general 
setting where the fourth moment constraint Q is replaced by a constraint in the following form: 
-E{|a;p+''} < M for some 5 > and M < oo. In a related work, Palanki 21\ has independently 
shown the discrete character of the optimal input for a general type of fading channels when only 
moment constraints strictly higher than the second moment are imposed. 

3.2 Peak Power Limited Input 

In this section, we assume that the input amplitude is subject to only a peak power limitation, 

|xi|2<'P Vi. (13) 

Although being more stringent than the fourth moment limitation, peak power constraint is more 
relevant in practical systems. For instance, efficient use of battery power in portable radio units 
and linear operation of RF amplifiers employed at the transmitter require peak power limited 
communication schemes. 

Since the peak constraint (fT^ is invariant to rotation of the input, optimality of uniform phase 
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follows from Proposition ^ Hence, our primary focus is on obtaining a characterization for the 
optimal amplitude distribution. Existence of a capacity-achieving amplitude distribution readily 
follows from the results on the second and fourth moment limited case. Similarly, we define R = 
■^\y\'^ and r = Specializing (jSI), we easily obtain the following sufficient and necessary 

condition for an amplitude distribution, Fq, to be optimal over the peak-power limited Rician 
channel: 

POO 

/ giR,r)\nfR{R,Fo)dR + \n{l+r^)+C + l>0 Vr G [O, (14) 
Jo 

with equality if r G -Eq where Eq is the set of points of increase of Fq. Note that a = 7^;^- Next, 
we state the main result on the optimal amplitude distribution. 

Theorem 2 For the Rician fading channel (OP where the input is subject to only a peak power 

a.s. 

constraint |xp < P, the capacity- achieving amplitude distribution is discrete with a finite number 
of mass points. 

Proof: Since the input is subject to a peak constraint, the result in this case is established by 
contradicting the assumption that the optimal input distribution has an infinite number of points 
of increase on a bounded interval. 

Assume Fq is an optimal distribution. To prove the theorem, we first find an upper bound on 
the left-hand-side (LHS) of (fT^ . To achieve this goal, we bound /ij(-,-Fo) as follows. 

fRiR;Fo) = I YTT^'^^'P V J ° 1 







< exp ( + v^) frh^-V (-^ ) dFo(r) (15) 



= Dp, exp (^- + ^) (16) 

where < Di^o < 1 is a constant depending on Fq. Upper bound (fT^ is easily verified by observing 
exp {-j^) < exp (-^) Vr < and Jq (^^g^) < Io{VKR) < exp(v^) Vr > 0. Using 
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(fTH|l . we have the following upper bound: 

roo -| POO poo 

/ g{R,r)\nfn{R;Fo)dR<\nDF^--— g{R,r)R dR + ^/K g{R,r)VRdR (17) 
Jo 1 + a Jo Jo 

<\nDF,- ^^ ^ + VK^I + (1 + Ky Vr>0. (18) 
1 + a 

The upper bound in ()18|) follows from the fact that g{R,r) is a non-central chi-square probability 
density function in R, and g{R, r)R dR = 1 + (1 + K)r2 and g{R, r)/R di? < + + K)r^ 
which follows from the concavity of ^/x and the Jensen's inequality. From (|TH)l . we obtain the 
following upper bound on the left hand side (LHS) of (fT^ : 

LHS < InDp, _ ^ + + + + (1 + Ky + ln(l + r^) + C + 1 Vr > 0. (19) 

1 + a 

Using the above upper bound, we show that the following assumption cannot hold true. 

i) Assume that the optimal input distribution Fq has an infinite number of points of increase on a 

bounded interval. Next we extend the LHS of (|14p to the complex domain: 

POO 

^{z)= g{R, z) In /«(i?, Fq) dR + ln(l +z^) + C + l (20) 
Jo 

where z G C. Since the condition in (|14j) should be satisfied with equality at the points of increase 
of the optimal input distribution, by the above assumption, ^{z) = for an infinite number of 
points having a limit point. Then by the Identity Theorem (SU], ^{z) = in the whole region where 
it is analytic. By the Differentiation Lemma [211 Ch. 12], one can easily verify that ^{z) is analytic 
in the region where Re(l + 2;^) > which includes the positive real line. Therefore we conclude that 
^(r) = Vr > 0. Clearly, this is not possible from the upper bound in ()19|) which diverges to —00 
as r 00 for any finite a, K > 0, and Df,^ > 0. 

Reaching a contradiction, we conclude that the optimal distribution must be discrete with a finite 
number of mass points. □ 
We note that Theorem |21 establishes the discrete structure of the optimal input distribution 
over the entire SNR range. The proof basically uses the observation that a bounded input induces 
an output probability density function that decays at least exponentially, and in turn provides a 



10 



diverging bound on the Kuhn- Tucker condition. We also note recent independent work by Huang 
and Meyn ^3] , where the discrete nature of the optimal input is proven by again showing a diverging 
bound on the Kuhn- Tucker condition for a general class of channels in which the input is subject 
only to peak amplitude constraints. 

4 Rician Fading Channel with Phase Noise 

In this section, we deviate from the classical Rician channel model ^ where the specular component 
is assumed to be static and consider the following model 

yi = {ai + me^'^')xi + ni. (21) 

where phase noise is introduced in the specular component. Here, {6i} is assumed to be a sequence 
of independent and identically distributed uniform random variables on [— vr, vr) and m is a deter- 
ministic complex constant. We again consider the noncoherent scenario where {aj} and {6i} are 
known by neither the receiver nor the transmitter. This model is relevant in mobile systems where 
rapid random changes in the phase of the specular component are not tracked. Moreover, such 
a model is suitable in cases where there is imperfect receiver side information about the fading 
magnitude. As another departure from the previous section, here we impose only an average power 
constraint, < Pav The discrete nature of the optimal input amplitude follows immediately 

from the techniques of Section El when there is an additional higher moment constraint. 

We immediately realize that the channel output, y, is conditionally Gaussian given x and 9, 



1 I \y ~ rne^^x 



2' 



fy\x,e{y\x,e) = —^--^-——exp \ 2\ \2 , AT • (22) 

Integrating ()22p over uniform 6, we obtain the conditional distribution of the channel output given 
the input, 

Jy\x{y\x) = ^^^^TTrexp Jo 21 12 i AT ' ^3) 

n[Y\x\ + Nq) \ Y\x\ + J \Y\x\'^ + NqJ 

We again introduce the following random variables: R = -^lyl"^ , f = Since the phase 

information is completely destroyed in the channel (PT|) and the above transformations are one-to- 
one, we have I{x;y) = I{\x\; \y\) = I{r;R). Furthermore the conditional distribution of R given r 
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is easily obtained from (^Hj): 

fMR\r) = exp ^-^^ j /o (^^^ j (24) 

where K = is the Rician factor. Similarly as in the previous section, the existence of an optimal 
amplitude distribution is shown and the following sufficient and necessary condition is given in pp. 

Proposition 3 fKuhn- Tucker Condition ) For the Rician channel model \21\] with an average power 
constraint E {\x\'^} < Pav, Fq is a capacity- achieving amplitude distribution if and only if there exists 
A > such that the following is satisfied 

-D{fR\r\\fR) + Kr^-a) + C>Q Vr>0 (25) 



with equality if r ^ Eq where Eq is the set of points of increase of Fq. In the above formulation, 
D{-\\-) is the divergence (e.g. l&S, Section 2.3]), fR{R;Fo) = Jq°° /^|j.(i?|r) dFo(r) is the density 
function of R, a = 'y'^SNR, and C is the capacity. 

The next theorem gives the main result on the structure of the optimal input for the Rician fading 
channel with phase noise. 

Theorem 3 For the Rician fading channel with uniform phase noise \21]) and average power con- 
straint < P, the capacity- achieving input amplitude distribution is discrete with a finite 
number of mass points. 

Proof: The result is shown by contradiction. Let Fq be an optimal amplitude distribution. The 
proof can be summarized as follows: 

i) We first assume that Fq has an infinite number of points of increase on a bounded interval. 
The impossibility of this case is shown by contravening the fact that under this assumption the 
left hand side of the Kuhn- Tucker condition which is extended to the complex domain is identically 
zero over its region of analyticity. 

ii) Then we assume that the optimal distribution is discrete with an infinite number of mass 
points but only finitely many of them on any bounded interval. This assumption is also ruled out 
by finding a diverging lower bound on the left hand side of the Kuhn- Tucker condition. 
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iii) Having eliminated the above assumptions, we are left with the only possibility that the 
optimal distribution is discrete with a finite number of mass points. 

Assumption 1: Assume that Fq has an infinite number of points of increase on a bounded 
interval. Then the Kuhn- Tucker condition ()25|) is satisfied with equality at an infinite number of 
points having a limit point. First we extend the left hand side of (j25|) to the complex domain: 
\E'(z) = —D{fji\r=z\\fR) + -^(2;^ — a) + C, where z E C Equivalently, using the fact that ffi\r is a 
noncentral chi-square density function in R, 



^{z) = j^^ fR\r{R\z) In fn{R) dR- fR\r{R\z) In [^0 y ^^^ jj 

2K^^ 

+\{z^ -a) + \n{l + z^) + Y^ + C + 1. (26) 

By the Differentiation Lemma 31, Ch. 12], it is easy to see that \& is analytic in the region where 
Re{l + z^} > and Re{z} > 0. The first condition guarantees the uniform convergence of the 
integrals in ()2fi|) by forcing the integrands to decrease exponentially. Since Iq has zeros on the 
imaginary axis and the second integral in ()26|) involves the logarithm of the Bessel function, with 
the second condition, Re{2;} > 0, we exclude the imaginary axis from the region of analyticity. 
Since "^{z) = for an infinite number of points having a limit point, by the Identity Theorem |3(Jj . 
"^{z) = in the whole region where it is analytic. In particular, we have \E'(j6+ = for all |6| < 1 
and n G and hence lim„^oo ^(j^ ~^ ~ ^ ^1^1 ^ ^- terms other than the second term 

in ()2fi|l are analytic also on the imaginary axis with < 1 and the limiting expression is obtained 
by letting ^ — > in the arguments of these functions. For the second term in ()2fij) . we need to 
invoke the Dominated Convergence Theorem (2^ to justify the interchange of limit and integral. 
An integrable upper bound on the magnitude of the integrand of the second integral is shown in 
Appendix D]. Therefore, we have 

lim ^{jb+-) = r fR\r{Rm\nfR{R)dR 

n^oo n Jq 

+A(-62-a)+ln(l-62)-- ^ + C + 1 = V|6| < 1. (27) 

1 — 0^ 
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Note that all the terms other than the second term in ()27p are real. Next we show that the second 
term in (|27|) has a nonzero imaginary component yielding a contradiction. First we evaluate the 
limit in the integrand as follows. 



lim fR\r{R 



In Jo' ^ 



n 



n2 



if lo (^W^) = 

^exp Jo (^) In (jo (^)) otherwise, 

which is obtained easily by observing that all the terms are analytic in the entire complex plane 
(excluding z = ±j) except the logarithm function which is not analytic at the zeros of the Bessel 
function. However, as we approach the zeros of the Bessel function, Jq In Jq and hence we 
obtain (j2HI)- Noting that Jo{z) = Io{jz) where Jo is the zeroth order Bessel function of the first 
kind, and that the set of zeros of the Jo function along the imaginary axis has measure zero, the 
second integral in ()27|) can now be expressed as follows: 

exp — Jo ^ In Jo ^ dJ?. (29) 



1 - 62 \ 1 _ 52 y - ^ 1 _ ^2 j-y^y i-b^ 

By applying a change of variables v = ^^^^ and expressing Inz = \n\z\ + jarg(2;), the above 
integral becomes 

/ i_52 







f exp I — + - — '—J ) Jo(f) In I Jo('y)| dv 



2K62 ^ \^ 4K62 '1-62 

In the above formulation h{v) = if f G (0,ai) and h{v) = k if v E {ak,ak+i) where {ctk} 
are the zeros of Jq. As noted in [T2], the second term in arises due to the fact that the 
logarithm jumps in value by jir when a zero is passed. This fact can be observed by considering 
the argument of Jo (6 — je) as e ^ where b — je is in a small neighborhood of ak such that 
Jo(6 - je) ^ jQ{ak){b-je - ak). 
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Using a similar bound obtained in [12], we have 
~ 1 - 62 / 1 - 62 







2K62 ^ V 4K62 



> 



,,,,,,, 2K62 \ / f3\i-b^)\ / 4K62 \ / a^l - b' , , 
> l>^o(/9)| - -57^ exp -^i^—-^ " U + 7^ exp (31) 



/3(1 - 62); ^ V 4K62 y 7ra2(l - 62) ; ^ V 4K62 

where 02 is the second smallest zero of Jo on the positive axis and (3 is the positive zero of Ji less than 
a2- As noted in |12j|, the above lower bound is positive for small enough values of 6. In particular, 
for each K > 0, the above lower bound is 9.16 x 10^^ when 62 = ^ityi ^ 

Assumption 2: Next we assume that the optimal distribution has an infinite number of mass 
points but only finitely many of them on any bounded interval. Following an approach similar to 
the one used in JH], we first bound fn as follows 

fR{R;Fo) = / h\r{R\r)dFo{r) = J^Pi fR\r{R\ri) (32) 
"^0 i=0 



>P^fR\riR\r^) (33) 

1 / R + Kr r 

1 + r 2 V 1 + 



^ Pi ^^exp ( — ^ J ) Wi WR> 0. (34) 



where pi and rj are the probability and location, respectively, of the z*^ mass point of Fq. We obtain 
the last inequality by using the fact that Jo(x) > 1 Vx > 0. Noting that fii\r{R\r) is a noncentral 
chi square density function in R, this bound leads to the following lower bound on the left hand 
side of (ES) 

2Kr2 

+ A(r2 - a) + ln(l + r^) + + C + 1 , Vi Vr > 0. (35) 

1 + r"' 



Noting that Jo < exp (^^^^) , we have 



1 + 
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Since we have assumed that the optimal distribution has an infinite number of mass points with 
finitely many of them on any bounded interval, we see that for any A > 0, we can choose an rj 
sufficiently large such that the lower bound (jH^j) diverges to cxd as r — ^ oo. But by our assumption 
the left hand side of (jHSjl should be equal to zero infinitely often as r — * oo, which is a contradiction. 
A = implies that the power constraint is ineffective. The impossibility of this case is shown in [S]. 
Therefore the theorem follows. □ 
Recent results [H] and jH] have shown the discrete nature of the optimal distribution for the 
two special cases of the model (j2H): the Rayleigh fading channel and the noncoherent AWGN 
channel. We have proven the discreteness of the capacity-achieving distribution in a unifying setting 
where there is both multipath fading and a specular component with random phase. For the 
noncoherent AWGN channel, Katz and Shamai ^ have shown that the optimal input has an 
infinite number of mass points. An interesting conclusion of Theorem IHl is that the presence of an 
unknown multipath component induces an optimal distribution with a finite number mass points. 
It is also of interest to consider the classical average-power-limited Rician fading channel for 
which tight upper and lower bounds on the capacity were derived in JH]- By Proposition^ we know 
that uniform phase is optimal for this model. Moreover, we have the following partial result on 
the optimal amplitude distribution which proves the suboptimality of input amplitude distributions 
with unbounded support such as the Rayleigh distribution. 

Theorem 4 For the Rician fading channel (OP with only an average power limitation E{\x\'^} < 
Pav, the optimal input amplitude distribution has bounded support. 

Proof: Assume Fq is an optimal distribution. We will prove the proposition by contradiction. So 
we further assume that Fq has unbounded support. With this assumption, for any finite M > 0, 




(36) 



(37) 



(38) 



-DFo,Mexp( 



R 



Vi? > VM > , 



(39) 



1 + M2 
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where < Dp^^ M < 1 VM > and VFq. Using we obtain the following lower bound on the 
left hand side of the Kuhn- Tucker condition^ 



LHS > InD 



1 



1 + K 
1 + M2 



+ ln(l + r^) + \{r^ - a) + 1 + C , Vr > VM > 0. 



Fo,M — 



1 + M2 



For any A > 0^ and Dp^^ M > 0, we can choose M sufficiently large such that the above lower 
bound diverges to infinity as r ^ oo. However, if the optimal input has unbounded support, the 
LHS of the Kuhn- Tucker condition should be zero infinitely often as r — > oo. This constitutes a 



5 Numerical Results 

In general, the number of mass points of the optimal discrete distribution and their locations and 
probabilities depend on the SNR. Analytical expressions for the capacity and the optimal distribution 
as a function of SNR are unlikely to be feasible. Therefore, we resort to numerical methods to 
examine this behavior. The numerical algorithm used here is similar to the ones employed in jB] 
and [F . In particular, we start with a sufficiently small SNR and maximize the mutual information 
over the set of two-mass-point discrete distributions satisfying the input constraints. Then we 
test the maximizing two-mass-point discrete distribution with the Kuhn- Tucker condition. If this 
distribution satisfies the necessary and sufficient Kuhn- Tucker condition, then it is optimal and the 
mutual information achieved by it is the capacity. As we increase the SNR, the required number of 
mass points monotonically increases, and therefore to obtain the optimum distribution we repeat 
the same procedure for discrete distributions with increasing numbers of mass-points.^ 

For the Rician fading channel (K > 0) with second and fourth moment input constraints, nu- 
merical results indicate that for sufficiently small SNR values, the two-mass-point discrete amphtude 
distribution 



^The Kuhn- Tucker condition for the average-power-hmited Rician case is essentially the same as with A2 = 0. 
^The impossibility of A = is shown in 

^We note that the results in this section are obtained numerically, and hence there is no analytical claim of 



contradiction and hence the theorem follows. 



□ 




(40) 



optimality. 
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is optimal. Note that this distribution does not depend on the Rician factor K. Figure [T] plots 
the left hand side of the Kuhn- Tucker condition (jSJ as a function of r for the distribution F{r) = 
0.9u{r) + 0.1u(r — for the Rician fading channel (K = 1) with a = 0.05 and k = 10. From 

the figure we see that the Kuhn- Tucker condition is satisfied and the optimal distribution is in 
the form given by (jin|) . Figures El and 01 plot the magnitude and the probability of the nonzero 
amplitude respectively as a function of SNR (A^o = 1) for various values of n. We immediately notice 
the significant impact of imposing a fourth moment constraint. When there is no such constraint, 
the nonzero amplitude migrates away from the origin as SNR —>■ while its probability decreases 
sufficiently fast to satisfy the average power constraint. This type of input is called flash signaling 
in However, as we see from Figures |21 and El if there is a fourth moment constraint with a finite 
K, then the behavior is quite different. The nonzero amplitude approaches the origin as SNR 
while its probability is kept constant. In the Rayleigh channel (K = 0), is still optimal at low 
SNR up to a point after which, as SNR is further lowered, the second moment constraint becomes 
inactive and we observe that the nonzero mass point approaches the origin more slowly while its 
probability decreases. From Fig. |31 which plots the capacity curves as a function SNR for various 
values of k in the low-power regime, we see that all the curves have the same first derivative at 
zero SNR. This may suggest that performance in the low-power regime is similar for any finite 
value of K. However, as we shall see in the picture radically changes when we investigate the 
spectral-efficiency/bit-energy tradeoff. 

For the peak-power limited Rician fading channel (K > 0), numerical results indicate that for 
sufficiently low SNR values, the optimal amplitude distribution has a single mass at the peak level 
y/P and hence all the information is carried on the uniform phase. For the Rayleigh channel (K = 0), 
an equiprobable two-mass-point distribution where one mass is at the origin and the other mass at 
the peak level is capacity-achieving in the low-power regime. Fig. El plots the capacity curves for 
the peak-power limited Rayleigh channel and Rician channels with K = 1, 2 as a function of the 
peak SNR. Note that the Rayleigh channel capacity curve has a zero slope at zero snr. 

For the Rician fading channel with phase noise (PT|) . numerical results illustrate again that a 
two-mass-point discrete distribution is optimal for sufficiently small SNR values. Figures IHl and U\ 
plot the magnitude and probability, respectively, of the optimal nonzero amplitude for this channel 
with Rician factors K = 0, 1,2. Note that only an average power constraint is imposed here. We 
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observe that flash-signaling-type optimal input, where the nonzero amplitude migrates away from 
the origin as SNR while its probability is decreasing, is required in the low-power regime. For 
fixed SNR, we also see that the nonzero amplitude is closer to the origin for higher Rician factors 
K. Finally Fig. |H1 provides the capacity curves as a function of SNR for Rician factors K = 0, 1, 2. 

6 Conclusion 

In this paper, we have analyzed the structure of the capacity-achieving input for the noncoherent 
Rician fading channel. We have limited the peakiness of the input by imposing a fourth moment 
or a peak constraint. Using a sufficient and necessary condition, we have proven that when the 
input is subject to second and fourth moment limitations, the optimal input amplitude is discrete 
with a finite number of levels in the low-power regime. It turns out that a particular two-mass 
point distribution that depends only on the SNR and k is asymptotically optimal as SNR — 0. 
Discreteness of the optimal input amplitude distribution has also been shown for the peak-power 
limited Rician channel over the entire SNR range. This time, the amplitude distribution with a 
single mass at the peak level is optimal in the low-power regime for the Rician channel with K > 0. 

We also have analyzed a Rician fading channel model where there is phase noise in the specular 
component. We have shown that under an average power limitation, the optimal amplitude is 
discrete with a finite number of levels. For this model, we have provided numerical results for the 
capacity and the optimal input distribution where we observed that a fiash-signaling-type input 
is required in the low-power regime. We have also proved that the optimal input for the average- 
power-limited classical Rician channel has bounded support. 
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Figure 1: The Kuhn-Tucker condition for K = 1, a = 0.05 and k — 10. F{r) — 0.9 u{r) + 0.1 u{r 
and C = 0.0531. Ai = 0.89106, A2 = 0.15135 
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Figure 3: Probability of the second mass point vs. normalized SNR = in the Rician channel 
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Figure 4: Capacity (in nats) vs. normalized SNR = 7^-^ for the Rician channel (K = 1) subject to 
fourth moment constraints with k = 2, 4, 5, 10, 50, 100. 



23 




2 4 6 8 10 12 14 16 18 20 

peak SNR 



Figure 5: Capacity curves as a function of tlie normalized peak SNR = 7^;^ for tiic peak-power 
limited Rayleigh channel (K = 0) and Rician channels with K — 1,2. 
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Figure 6: Magnitude of the nonzero amplitude vs. normalized SNR — 7^"^ for the Rician channel 
with phase noise K = 0, 1,2. 
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Figure 7: Probability of the nonzero amplitude vs. normalized SNR = 7^-^ for the Rician channel 
with phase noise K = 0, 1, 2. 
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Figure 8: Capacity curves as a function of the normalized SNR = for the Rician channel with 

phase noise with Rician factors K = 0, 1, 2. The dashed segments are interpolated capacity curves. 
Numerical optimization methods do not provide stable results in these regions where a new mass 
point is emerging with a very small probability. 
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